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ABSTRACT 

We have extended and improved the statistical test recently developed by Rauzy for 
assessing the completeness in apparent magnitude of magnitude-redshift surveys. Our 
improved test statistic retains the robust properties - specifically independence of 
the spatial distribution of galaxies within a survey - of the T c statistic introduced in 
Rauzy's seminal paper, but now accounts for the presence of both a faint and bright 
apparent magnitude limit. We demonstrate that a failure to include a bright magnitude 
limit can significantly affect the performance of Rauzy's T c statistic. Moreover, we have 
also introduced a new test statistic, T v , defined in terms of the cumulative distance 
distribution of galaxies within a redshift survey. These test statistics represent powerful 
tools for identifying and characterising systematic errors in magnitude-redshift data. 

We discuss the advantages of the T c and T v statistics over standard completeness 
tests, particularly the widely used V/V m ax test which assumes spatial homogeneity, 
and we demonstrate how our T v statistic can essentially be regarded as an improved, 
cumulative V/V max test which makes better use of the magnitude completeness in- 
formation in a redshift survey. Finally we apply our completeness test to three major 
redshift surveys: The Millennium Galaxy Catalogue (MGC), The Two Degree Field 
Galaxy Redshift Survey (2dFGRS), and the Sloan Digital Sky Survey (SDSS). We 
confirm that MGC and SDSS are complete up to the published (faint) apparent mag- 
nitude limit of rribj — 20.00 mag. and m r — 17.45 mag. respectively, indicating there 
are no residual systematic effects within the photometry. Furthermore, we show that, 
unless a bright limit is included for 2dFGRS, the data-set displays significant incom- 
pleteness at magnitudes brighter than the published limit of = 19.45 mag. 

Key words: Cosmology: methods: data analysis - methods: statistical - astronomical 
bases: miscellaneous - galaxies: redshift surveys - galaxies: large-scale structure of 
Universe. 



1 INTRODUCTION 

In recent years the statistical analysis of galaxy redshift sur- 
veys has become a thriving industry in cosmology, yielding 
powerful constraints on the parameters of both the underly- 
ing cosmological world model and on the luminosity distri- 
bution of galaxies as a function of redshift, environment and 
morphological type. However, both tasks are rendered com- 
plicated by the presence of observational selection effects 
- due to e.g. detection thresholds in apparent magnitude, 
colour, surface brightness or some combination thereof. Over 
many years, therefore, a wide range of statistical tools has 
been developed to identify, characterise - and hopefully to 
remove - the impact of observational selection effects from 
magnitude-redshift surveys. 

Of particular interest are data-sets which are complete 
in apparent magnitude - meaning that all galaxies brighter 
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than some specified limiting apparent magnitude (or, as is 
pertinent to this paper, with apparent magnitudes lying be- 
tween some specified bright and faint limiting values) have 
been observed. The case of magnitude-redshift data trun- 
cated by a faint apparent magnitude limit has been exten- 
sively discussed in the literature and well-established tech- 
niques have been developed for reconstructing the galaxy 
luminosity function in this case. Th e se inc lude, for exam- 
ple, the C— method of iLvnden-Belj lll97ll) . the maximum 
likelihood fitting method of ISandage etaU (I1979T) and the 
stepw ise maximum likelihood method of lEfstathiou et al] 
(1988). However, these methods are formulated under the 
assumption that the survey data are complete in apparent 
magnitude; hence, the assumption of magnitude complete- 
ness must be rigourously checked. 

A classical test for completeness in apparent magnitude 
is to analyse the variation in galaxy number counts as a func- 
tion of the adopted limiting apparent magnitude (Hubble 
1926). This test, which presupposes that the galaxy popu- 
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lation does not evolve with time and is homogeneously dis- 
tributed in space, is however not very efficient. More specif- 
ically, it is difficult to decide in practice whether deviations 
from the expected galaxy number count is indeed an ef- 
fect of incompleteness in apparent magnitude, or is due to 
galaxy clustering and/or evolution of the galaxy luminosity 
function, or is indeed created by incompleteness in apparent 
magnitude. Of course in designing a completeness test one 
can also make use of distance inform ation via galaxy red- 
shifts; the well-known V/V ma x test of ISchmidtl (|196a ) does 
this, and considers - for a specified magnitude limit - the 
ratio of two volumes: the volume of a sphere of radius equal 
to the actual distance of observed galaxy, divided by the vol- 
ume of a sphere of radius equal to the maximum distance at 
which the galaxy would be observable - i.e. at the apparent 
magnitude limit. It follows that - for a non-evolving, ho- 
mogeneous distribution of galaxies - the expected value of 
V/Vmax is equal to 1/2. The V/Vmax test has been used 
to assess the co mpleteness of magnitude- r edshif t samples 
(see for example iHudson and Lvnden-Belll (|l99ll ). but un- 
fortunately it suffers from the same major drawbacks as the 
Hubble test based on galaxy number counts: it is difficult to 
interpret whether any significant measured departure from 
the expected value of V/Vmax is due to incompleteness or 
to clustering and evolutionary effects. 

In an importa nt (although rarely cited) paper, 

lEfron and Petrosian! l| 19921 ) introduced a powerful new ap- 
proach to analysing magnitude-redshift surveys. They pro- 
posed a non-parametric test for the independence of the spa- 
tial and luminosity distributions of galaxies in a magnitude- 
limited sample, which required no assumptions concerning 
the parametric form of both the spatial distribution and 
the galaxy luminosity function. They applied this test to a 
quasar sample, with an assumed apparent magnitude limit, 
in order to robustly estimate the cosmological parameters 
characterising the lu minosity distance-redshift relation of 
the quasars (see a lso l|Efron and Petrosianlll998l ')'). 

iRauzvl (|200lh noted that the essential ideas of Efron 
& Petrosian could be straightforwardly adapted and ex- 
tended to turn their non-parametric test of the cosmolog- 
ical model into a non-parametric test of the assumption of 
a magnitude-limited sample - thus developing a simple but 
powerful tool for assessing the magnitude completeness of 
magnitude-redshift surveys. As was the case with Efron & 
Petrosian - and unlike the Hubble number counts or V/Vmax 
tests - the Rauzy test statistic requires no assumption that 
the spatial distribution of galaxies is homogenous. Moreover, 
it also requires no knowledge of the parametric form of the 
galaxy luminosity function. On the other hand, the Rauzy 
test was formulated only for the case of a sharp, faint ap- 
parent magnitude limit. The aim of this paper will be to 
extend Rauzy's formalism to account for a bright apparent 
magnitude limit, and to apply the extended completeness 
test to several recent galaxy redshift surveys. 

This paper is, therefore, organised as follows. In Sec- 
tion 2 we review the completeness test introduced bv lRauzvl 
(|200lh and extend it to the case of a galaxy survey with both 
a faint and a bright apparent magnitude limit. In Section 3 
we then introduce a further variant on the original Rauzy 
completeness test, which is based on the cumulative distance 
distribution of observable galaxies in a magnitude-redshift 
survey. In Section 4 we briefly describe the properties of 



three recent redshift surveys: the Millennium Galaxy Cat- 
alogue, the Sloan Digital Sky Survey and the Two Degree 
Field Galaxy Redshift Survey. In Section 5 we then apply 
our new completeness test to these three surveys, investi- 
gating their completeness in apparent magnitude. Finally, 
in Section 6 we summarise our conclusions. 



2 EXTENDING THE RAUZY 
COMPLETENESS TEST 

In this section we review briefly the robust completeness test 
introduced by Rau zy (2001; hereafter R01) and applied in 
iRauzv et al.l (|200ll ) , and extend it to include the effect of an 
imposed bright apparent magnitude limit to a magnitude- 
redshift survey. While this extension is straightforward, our 
approach in this paper will be rather pedagogical in order 
to benefit those readers not previously familiar with the for- 
malism previously developed in R01 and Rauzy, Hendry & 
D'Mellow (2001). 

2.1 Assumptions and statistical model 

The fundamental assumption of the Rauzy completeness test 
is that the luminosity function of the galaxy population does 
not depend on the 3-D redshift space position z = (z,l,b) 
of the galaxies. Although this assumption is restrictive, note 
that it is common to most classical number counts tests of 
completeness and indeed (when applied in the context of 
assessing magnitude completeness) the V/Vmax test. More- 
over, the results derived in Appendix 1 of Rauzy, Hendry 
& D'Mellow (2001) imply that the completeness test of R01 
remains valid in the case of pure density evolution. Note 
also that the completeness test remains valid for the case 
of pure luminosity evolution provided that an evolution cor- 
rection is applied to account for the (assumed known) func- 
tional dependence of the mean luminosity at given redshift. 
For clarity, we will defer until a subsequent paper the in- 
teresting case where one wishes simultaneously to assess the 
magnitude completeness and estimate the parameters of a 
luminosity evolution model. In this paper, where appropri- 
ate, we will apply evolutionary and extinction corrections 
from the literature to the photometry of the galaxy surveys 
we consider. Specifically, following the notation of R01, we 
introduce the corrected distance modulus Z, defined as 

Z = m- M = n(z) + k COI {z) + e cor (z) + A g (l, b), (1) 

where /j,(z) is the (cosmological model-dependent) distance 
modulus at redshift z, k COT (z) and e cor (z) are fe-corrections 
and evolutionary corrections respectively and A g (l,b) is 
an extinction correction dependent on galactic co-ordinates 
(l,b). Note, however, as will be seen in Section 5 below, 
that the application (or not) of fc-corrections and evolution- 
ary corrections generally does not have a strong impact on 
the performance of our completeness test. Neglecting for the 
moment any observational selection effects, the joint prob- 
ability density in position and absolute magnitude for the 
galaxy population can be written as 

dP zM <xdP z x dP M = p(z,l,b)dldbdz x f(M)dM, (2) 

where p(z, I, b) is the 3D redshift space distribution func- 
tion of the sources along the past light-cone and f(M) 
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Figure 1. Schematic diagram illustrating the construction of the rectangular regions Si and S2, defined for a typical galaxy at (Mi, Zi). 
The left panel shows the original R01 construction of regions Si and S2. The right hand panel shows the construction of regions Si 
and 52 with the inclusion of a bright limit. These regions are uniquely defined for a slice of fixed width, 8Z, in corrected distance 
modulus, and for 'trial' bright and faint apparent magnitude limits and respectively. Also shown are the true bright and faint 
apparent magnitude limits and "if im ; within which the rectangular regions Si and S2 contain a joint distribution of M and Z that 

is separable. 



is the galaxy lumin osity function, denned following e.g. 
iBinggeli et all (Il988l ). We now take as our null hypothesis 
that the selection effects are separable in position and ap- 
parent magnitude, and that the observed sample is complete 
in apparent magnitude for those objects which are simulta- 
neously brighter than a specified faint apparent magnitude 
limit, mf im , and fainter than a specified bright apparent 
magnitude limit, m nm . Under this null hypothesis the se- 
lection function ip(m, z, I, b) can be written as 

lj)(m, z, I, b) = #(m lim — m) x 6(m — m nm ) x (p(z, I, b), (3) 

where 9(x) is the Heaviside or 'step' function defined as 



9{x) 



if x > 0, 
if x < 0, 



(4) 



and 4>{z,l,b) describes the selection effects in angular posi- 
tion and observed redshift. Taking into account this model 
for the selection effects, the probability density function de- 
scribing the joint distribution of absolute magnitude M and 
corrected distance modulus Z for the observable galaxy pop- 
ulation may therefore be written as 



dP = h{Z)dZ f(M)dM 9(m{ im - m)6»(m - mf im ), 



(5) 



where h(Z) is the probability density function of Z for ob- 
servable galaxies, marginalised over direction on the sky, i.e. 



h(Z) = 



h(Z,l,b)dldb, 



(6) 



and the integrand h(Z,l,b) is equal to the (suitably nor- 
malised) product of the 3-D redshift space density p(z,l,b) 



and the selection function (j>(z,l,b), re-expressed as a func- 
tion of Z. 

Note from Equation ([5} that the faint and bright ap- 
parent magnitude limits introduce a correlation between the 
variables M and Z for observable galaxies. 



2.2 Defining the random variable ( 

As in ROf, the key element of our extended completeness 
test is the definition of a random variable, (, related to the 
cumulative luminosity function of the galaxy population. We 
proceed in a similar manner to R01, but now with both a 
bright and faint apparent magnitude limit. To see how we 
construct £ in this more general case consider Fig. 1, which 
schematically represents an M — Z plot of corrected distance 
modulus versus absolute magnitude for the observable pop- 
ulation of galaxies. The left hand panel shows this plot for 
the case of a faint apparent magnitude limit only, as was 
considered in ROT The right hand panel is for the more 
general case which we consider here. Shown on the right 
hand panel are solid diagonal lines representing the 'true' 
faint and bright apparent magnitude limits, m[ im and m\ m 
respectively, while the bold diagonal lines represent putative 
faint and bright magnitude limits, and respectively. 
The position (Mi,Zi) of the i th galaxy is also indicated on 
both panels, and the schematic diagrams are superimposed 
on the actual M — Z distribution for the Millennium Galaxy 
Catalogue (Driver et al. 2005; see below). 

In graphical terms, the essential idea of our complete- 
ness test is to identify from the data the faintest value of m f , 
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and the brightest value of mi which together bound a rect- 
angular region of the M — Z plane, within which the joint 
distribution of M and Z for observable galaxies is separable. 
If we compare the left and right hand panels of Fig. 1, we 
can see that the addition of a bright magnitude limit has an 
important impact on the construction of this separable re- 
gion: in short, the region is no longer unique. However, if we 
fix the width, 8Z, in corrected distance modulus as shown in 
the right hand panel of Fig. 1, the corresponding separable 
region is now uniquely defined. Moreover we can then define 
for the i th galaxy the following absolute magnitudes: 

• M\ iin (Zi), the absolute magnitude of a galaxy, at cor- 
rected distance modulus Zi, which would be observed at the 
true faint apparent magnitude limit mi im , 

• Mijjn^Zi — 8Z), the absolute magnitude of a galaxy, at 
corrected distance modulus Zi — 8Z, which would be ob- 
served at the true bright apparent magnitude limit m[| m . 

These two absolute magnitudes are indicated, for the puta- 
tive magnitude limits mi and mi, by the vertical dashed 
lines in the right hand panel of Fig. 1. 

We now define the random variable £ as follows 



c 



F(M) - F(M£ m (Z - SZ)) 



F(ML(Z))-F(M^JZ-5Z))' 
where F(M) is the Cumulative luminosity function, i.e. 

I'M 

F(M) = l f(x)dx. 



(J) 



(8) 



It is straightforward to show from this definition that 
the random variable £ has a uniform distribution on the in- 
terval [0, 1], and furthermore that £ and Z are statistically 
independent. Thus £ shares the same two defining properties 
as the corresponding random variable defined in R01. Equa- 
tion © therefore generalises the definition of £ to the case of 
a galaxy survey with bright and faint magnitude limits. The 
relevance of £ as a diagnostic of magnitude completeness will 
be demonstrated in the next two sections. 

2.3 Estimating £ and computing the T c statistic 

As was the case in R01, the random variable £ has the very 
useful property that we can estimate it without any prior 
knowledge of the cumulative luminosity function F(M). 
Given a value of SZ, it is clear from Fig. 1 that for each 
point (Mi, Zi) in the M — Z plane we can define the regions 
Si and 5*2 as follows: 

• Si = {(M, Z) : Af£ m < M < Mi, Zi- SZ < Z < Z{\, 

• S 2 = {(M, Z) : Mi < M ==: M& m , Z t -8Z^Z ^ Zi}. 

In the special case where there is no bright limit the regions 
Si and S2 are as shown in the left hand panel of Fig. 1. 

Clearly the random variables M and Z are now inde- 
pendent within each sub-sample Si and S2. Therefore from 
Equation © the number of points belonging to Si satis- 
fies 

h(Z')dZ' x / f(M)dM, (9) 

I Zi-SZ J M}>. 



where N ea ,\ is the total number of galaxies in the sample. 
Similarly the number of points m in Si — Si U S2 satisfies 



= / h(Z')dZ' x 



f(M) dM. 



(10) 



The integrals over absolute magnitude in Equations Q and 
(110[) may be rewritten as 



Mi 

and 

M. 1 ? 



f(M) dM = F (Mi(Zi)) - F (M& m (Zi)) , (11) 



f(M) dM = F (ML(Zi)) - F (ML(^)) • (12) 



Thus, given a pair of 'trial' magnitude limits m\ and mi, it 
follows from Equations (J7]l and ((9} to (|12|l that an estimate 
of £ for the i th galaxy is simply the ratio of the number of 
galaxies belonging to Si and Si U S2 respectively (where 
and Ml replace My, m and My m in the definition of Si and 
S2). In fact an unbiased estimate of Q for the i th galaxy is 
(c.f. R01) 

= r~TT- (13) 



rii + 1 



This estimator is identical to that defined in R01; the intro- 
duction of a bright magnitude limit has simply changed the 
definition of the random variable £ itself and the member- 
ship criteria of the two regions Si and S2. Thus, provided 
that both mi ^ nin m and m° t ^ m\ m , then under our null 
hypothesis Q will be uniformly distributed on [0, 1] and un- 
corrected with Zi, exactly as was the case in R01. Moreover 
the expectation value Ei and the variance Vt of the £i are 
given respectively by 



Ei = E(0) 



r, Vi = E 



(Ci-Ei)'' 



1 Hi - 1 



12 m + 1 



(14) 



Note that Vj tends towards the variance of a continuous 
uniform distribution between and 1 when n» is large. 

As in R01, we can, therefore, combine the estimator Q 
for each observed galaxy into a single statistic, T c , which we 
can use to test the magnitude completeness of our sample 
for adopted trial magnitude limits mi and mi . T c is defined 



T c = 



"gal 




(15) 



If the sample is complete in apparent magnitude, for a given 
pair of trial magnitude limits, then T c should be normally 
distributed with mean zero and variance unity. If, on the 
other hand, the trial faint (bright) magnitude limit is fainter 
(brighter) than the true limit, T c will become systematically 
negative, due to the systematic departure of the £j distribu- 
tion from uniform on the interval [0, 1]. 



3 TV: A VARIANT OF THE RAUZY 
COMPLETENESS TEST 

In Section 1 we remarked on the similarities between the 
V/Vmax test statistic and the completeness test of R01. We 
now introduce a further variant on the test statistic T c , re- 
lated to the distribution of corrected distance modulus for 
observable galaxies in a magnitude-redshift survey. 
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Figure 2. : Schematic diagram illustrating the construction of the rectangular regions 53 and 54, defined for a typical galaxy at (Mi, Zi), 
which feature in the estimation of our new completeness test statistic, T v . Panel (a) illustrates how 53 and 54 are constructed for a 
survey with only a faint magnitude limit "if im , and are shown for a trial faint limit m+ . Panel (b) illustrates the case where the survey 
also has a true bright limit Wi}?, and the rectangles are constructed for trial bright and faint limits m{| and m[ im respectively. Note 
that the construction of the rectangles is unique for a 'slice' of fixed width, SM, in absolute magnitude. 



3.1 Defining the random variable r 

Fig. [2]shows schematic M — Z plots, analogous to Fig. 1: the 
left panel again has a faint apparent magnitude limit only 
while the right hand has a 'true' bright and faint apparent 
magnitude limit, mS| m and rr% m , with 'putative' bright and 
faint limits, and m t respectively, shown as the bold diag- 
onal lines. Again, the position, (Mi,Zi), of a typical galaxy 
is shown on each panel, and the schematic plots are super- 
imposed on the actual M — Z distribution of the Millennium 
catalogue. 

In the right hand panel of Fig. [2] however, we consider 
a 'slice' of width 5M in absolute magnitude brighter than 
Mi, as shown. We see that, given m[| m , mf im and SM, we 
can define for the i th galaxy the following corrected distance 
moduli: 

• Z U pp(Mi), the corrected distance modulus of a galaxy, 
with absolute magnitude Mi, which would be observed at 
the true faint apparent magnitude limit m\ im , 

• Z\cm(Mi — 8M), the corrected distance modulus of a 
galaxy, with absolute magnitude Mi — SM, which would be 
observed at the true bright apparent magnitude limit mj^. 

These two limiting distance moduli are indicated, for the 
putative magnitude limits m* and m», by the horizontal 
dashed lines in Fig. 2. 

We now define a new random variable r as follows. Let 
H(Z) denote the cumulative distribution function of cor- 
rected distance modulus for observable galaxies, i.e. 

H(Z) = [ h(Z')dZ'. (16) 



Then r is defined as 

H(Z)-H(Z low (M-8M)) 
T H(Z» PP (M))-H(Z low (M-SM))' ( ' 

As was the case for the random variable (, it is straightfor- 
ward to show that r possesses the following properties: 

• PI: r is uniformly distributed between and 1, 

• P2: r and M are statistically independent. 

These two properties are exactly analogous to the defining 
properties of £ , except that r is now independent of corrected 
absolute magnitude, M. Once again we can use property PI 
to construct a test for completeness in apparent magnitude. 



3.2 Estimating r and computing the T v statistic 

Under the assumptions introduced in the previous section, 
it follows that r can be estimated from our observed data 
without any prior knowledge of the spatial distribution of 
galaxies. To see how this estimate is constructed, consider 
again the right hand panel of Fig. [5] For each point with 
co-ordinates (Mi, Zi) in the M — Z plane we can define the 
regions S3 and S4 as follows: 

• S3 = {(M, Z) : Mi - SM < M ^ M u Z lo „ ^Z^Z,}, 

• Si = {(M, Z):Mi- SM <: M ^ M h Z, < Z < Z l upp }. 

In the special case where there is no bright limit the 
regions S3 and 54 are as shown in the left hand panel of 
Fig. 1. Indeed, in this case, S3 is identical to the region Si 
shown on the left hand panel of Fig. 2. 
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As was the case in Section 2, we see that the random 
variables M and Z are independent in each sub-sample 5*3 
and 5*4. Therefore we can estimate r by counting the num- 
ber, Si, of galaxies that belong to S3 and the number, ti, of 
galaxies that belong to S3 US4. As in Section 2, an unbiased 
estimate of r is given by 

(18) 



U + 1 



m hmi 



Thus, provided that both m» ^ m[ im and raj 
then under our null hypothesis f, will be uniformly dis- 
tributed on [0, 1] and uncorrelated with Z%, exactly as for 
Q. Moreover the expectation Ei and variance Vj of the ti 
are respectively 



EA 2 ] = ± U 



12 U + 1 



(19) 



Again, the variance of f, tends towards that of a continuous 
uniform distribution between and 1 for large ti. 

We can, therefore, again combine the estimator fj for 
each observed galaxy into a single statistic, T v , which we 
can use to test the magnitude completeness of our sample 
for adopted trial magnitude limits ml and m*. T c is defined 



Jv gal 

r -E(*-i) 




(2(1) 



If the sample is complete in apparent magnitude, for a given 
pair of trial magnitude limits, then T v should be normally 
distributed with mean zero and variance unity. If, on the 
other hand, the trial faint (bright) magnitude limit is fainter 
(brighter) than the true limit, in either case T v will become 
systematically negative, due to the systematic departure of 
the ti distribution from uniform on the interval [0, 1]. 



4 THE DATA 

We will now apply the tools developed in the previous sec- 
tion to test the magnitude completeness of three major red- 
shift surveys: the Millennium Galaxy Catalogue (MGC), the 
Two Degree Field Galaxy Redshift Survey (2dFGRS) and 
The Sloan Digital Sky Survey (SDSS-DR1, Early Types). 
For ease of comparison we have assumed the same back- 
ground cosmological model but have applied existing selec- 
tion criteria (detailed below) for each survey. 



4.1 Cosmology 

Unless otherwise stated we have adopted throughout a 'Con- 
cordance' cosmology with present-day matter density Q m o = 
0.3 and cosmological constant term Qao = 0.7, and with a 
value of Ho = 100 kms -1 Mpc -1 for the Hubble Constant 
throughout. 

In order to convert the apparent magnitudes published 
for each survey to corrected absolute magnitudes we apply 
the following relation: 

Mi = raj — 51og 10 (dz, 4 )— 25— b)-k COI (zi)-e COI (zi), (21) 

where dh ( is the luminosity distance (in Mpc) of the i th 
galaxy given by: 



d Li = (1 + zt) (-pi = =, (22) 

^ M oJ J ^/(l + Zi ) 3 fi ro0 + Qao 

and the other terms are as defined in Section 2.1 above. 



4.2 Selection limits, fc-corrections and 
evolutionary corrections 

4.2.1 2dFGRS 

The 2dF Galaxy Redshift Survey measured redshifts using 
the multifibre spectrograph on the Anglo- Australian Tele- 
scope. We have used the 2dFGRS public final release data- 
set, from the 'best observations' spectroscopic catalogue, 
which records redshifts for a total of 245,591 sources. 

The corresponding photometry was taken from the 
APM galaxy catalogue (Maddox et al. 1990) for galaxies 
brighter than an apparent magnitude of nib = 19.45 mag. 
The 2dFGRS survey region covered two strips: one 75° x 10° 
around the north galactic pole and the other 80° x 15° 
around the southern galactic pole. 

To construct a clean catalogue, we firstly selected those 
galaxies with reliable redshifts - i.e all galaxies with a pub- 
lished redshift quality Q z > 3. We then imposed maximum 
and minimum limits in redshift following Cross et al. 2001 
- i.e. redshifts in the range 0.015 < 2 < 0.12. From the par- 
ent catalogue of 245,591 sources we use a total of 111,082 
galaxies. 

There have been a variety of fc-corrections and evolu- 
tionary corrections applied to the 2dFGRS. In ou r analy- 
sis we have ado pted those applied by ICrossI (120011 ) and by 
iNorberd (|2002al fbl). 



4-2.2 The SDSS early-type galaxy sample 

For our analysis we used galaxies selected from the Sloan 
Digital Sky Survey (SDSS) database. See Stoughton et al. 
(2002) for a description of the Early Data Release; Abaza- 
jian et al. (2003) et al. for a description of DR1, the First 
Data Release; Gunn et al. (1998) for a detailed description of 
the camera; Fukugita et al. (1996), Hogg et al. (2001) and 
Smith et al. (2002) for details of the photometric system 
and calibration; Lupton et al. (2001) for a discussion of the 
photometric data reduction pipeline; York et al. (2002) for a 
technical summary of the SDSS project; Pier et al. (2003) for 
the astrometric calibrations; Blanton et al. (2003) for details 
of the tilling algorithm; Strauss et al. (2002) and Eisenstein 
et al. (2001) for details of the target selection. 

In broad terms, the SDSS sample includes spectroscopic 
information as well as photometric measurements in the 
u*,g*,r*,i* and z* bands. The SDSS First Data Release 
covers an area of ~ 2000 deg 2 (Abazajian et al. 2003) on 
the sky. 

The main quantities used in this work are the abso- 
lute and apparent magnitudes, and redshifts present in the 
SDSS-First Data Release, early types only (hereinafter re- 
ferred to as 'SDSS-DR1' ). The selection criteria has been 
■ ' — I 1 1 

discussed in Bcrnardi et al. (2003) and the data-set com- 
piled in iBernardi et alj (120051 ) . 39,320 objects have been 
targeted as early-type galaxies and having dereddened Pet- 
rosian (hereinafter referred to as, nn r ) apparent magnitude 
(14.5 < m r < 17.75), and a redshift range of (0.0 < z < 
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0.4). To calculate the distance modulus we assume a Hub- 
ble constant of 70 km s _1 Mpc -1 . 

4-2.3 The Millennium Galaxy Catalogue 

The Millennium Galaxy Catalogue (MGC) is a medium- 
deep, Bj-band imaging survey, spanning 30.9 deg 2 that is 
fully contained within the 2dFGRS and SDSS-DR1. The 
full catalogue contains 10095 galaxies out to a published 
limiting apparent magnitude of mb J = 20.00 mag (e.g. see 
ICross et all (|2004T ) for more detail). The photometry was 
obtained with the Wide Field Camera on the 2.5 m Isaac 
Newton Telescope in La Palma. The spectroscopy was con- 
structed mainly from the redshifts obtained in the 2dFGRS 
and SDSS. In addition, the MGC team measured their own 
redshifts using the spectrograph on the Anglo- Australian 
Telescope for galaxies in the catalogue that had no assigned 
redshift. 

Similarly with the 2dFGRS catalogue, only galaxies of 
a redshift quality Q z ^2 3 have been selected. For ease 
of comparison we have imposed the same redshift cut as 
iDriver et alJ (|2005l l - i.e only galaxies in the range 0.013 < 
2 < 0.18 were included. From the parent catalogue of 10,095 
galaxies this yields a sub set of 7,878 galaxies. Where ap- 
propriate we have applied the fc-corre ctions and ev o lution ary 
corrections as described in detail bv lDriver et al. I (|2005l). 

5 RESULTS 

5.1 Testing the MGC dataset 

5.1.1 The Rauzy method with a faint limit only 

Fig. [3] shows the T c statistic as applied to the MGC survey. 
Since there was no bright limit published for this dataset we 
can use the traditional construction of the random variable Q 
as described in R01. The dashed curve shows the T c statistic, 
as a function of trial magnitude limit, computed using ap- 
parent magnitudes that have not been (fc + e)-corrected, but 
have been corrected for extinction only. The figure clearly 
shows that the T c statistic remains within the 3<r limits - 
consistent with being complete in apparent magnitude - up 
to the published magnitude limit of 20.0 mag., but then 
drops very sharply for trial apparent magnitude limits be- 
yond 20.0 mag. 

The solid curve on the same plot again shows the T c 
statistic as a function of trial magnitude limit but now com- 
puted for (k + e)-corrected apparent magnitudes. Although 
the MGC dataset is still consistent with being complete up 
to the published magnitude limit of 20.0 mag., there is a no- 
ticeable departure in the behaviour of T c from that for the 
uncorrected dataset: for trial magnitude limits in the range 
mb ~ 17.5 to mb — 20.00, T c for the corrected dataset 
exhibits a slow decline, before again dropping sharply be- 
yond 20.0 mag. The most likely explanation for this fea- 
ture seems to lie in the way the dataset is selected and cor- 
rected. The raw dataset, with uncorrected magnitudes, has 
the same magnitude limit imposed on all galaxies indepen- 
dent of their galaxy type. If, then, each galaxy is individually 
(k + e)-corrected, the resultant overall magnitude limit for 
the corrected data will become 'fuzzy' without a sharp cut- 
off. Furthermore, different galaxy populations will be scat- 
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Limiting apparent magnitude m + 

Figure 3. The T c statistic computed for MGC survey. We have 
selected galaxies in the range 0.015 < z < 0.12 with a quality 
Q z ~^ 3. The dashed curve is for all galaxies with no (k + e)- 
corrections applied. The solid curve shows how the T c computed 
after the application of the (k + e)-corrections of Driver et al. 
(2005). 



tered differently, leading to a smooth decrease close to the 
original uncorrected magnitude limit. On the other hand, 
if we do not apply a (k + e)-correction, the original mag- 
nitude limit remains defined (albeit now without explicitly 
accounting for the effects of evolution and redshifting of each 
galaxy's spectral energy distribution). Therefore, to obtain 
an improved measure of completeness which does properly 
incorporate a (k + e)-correction, one could apply ROBUST 
separately to subsets of different galaxy type. This would, in 
principle, lead to the definition of different apparent magni- 
tude limits for different galaxy types. In any event, it is clear 
from Fig. |3]that the impact on the inferred 'global' apparent 
magnitude limit of applying, or not, (A: + e)-corrections to 
the MGC dataset is small. 

5.1.2 Imposing a Bright Limit 

Having established that MGC is indeed complete up to the 
published faint magnitude limit of 20.0 mag., we can now 
use this survey to demonstrate how the introduction of a 
bright limit can affect the Rauzy completeness test, if not 
properly accounted for. 

To this end, we take the MGC data-set (with no (fc + e)- 
corrections applied) and introduce three, increasingly faint, 
artificial bright limits in apparent magnitude: nib- > 14, 
m,b > 15, and mj. > 16 respectively. Fig. U (left) shows 
the resulting T c curves for the data-sets with these artifi- 
cial bright limits, but where T c was computed assuming no 
bright limit. The plots clearly show that, as the bright limit 
is made progressively fainter, the computed value of T c devi- 
ates more strongly from the behaviour expected for a com- 
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Figure 4. The T c statistic computed for the MGC survey (without (k + e)-corrections) but now illustrating the effect, close to the faint 
limit, of imposing artificially a bright apparent magnitude limit on the selected galaxies. In the left panel the solid black curve shows 
T c computed assuming no bright limit (identical to the right hand portion of the dashed curve in Fig. 2) while the other three curves 
correspond to progressively fainter bright limits, at > 14, > 15 and mj,. > 16 respectively. For all four curves we calculated T c 
following Rauzy (2001) - i.e. assuming no bright limit. We can clearly see that the presence of a bright limit, if ignored, has a significant 
impact on the computed value of T c for faint magnitudes, and thus would adversely affect the assessment of magnitude completeness 
close to the faint limit. In the right hand panel we repeat our analysis for the same four cases as in the left panel, but now use our 
extended method which explicitly accounts for the presence of a bright limit. We can clearly see that the performance of T c is no longer 
adverseky affected, and a consistent estimate of the faint magnitude limit is obtained for different imposed bright limits. 



plete data-set. This trend is as expected: as can be seen in 
Fig. 1, the presence of the bright limit breaks the separa- 
bility of the M and Z distributions for observable galaxies. 
Hence, if the bright limit is ignored then the computed value 
of T c will be systematically biased. 

We now impose the same artificial bright limits as be- 
fore but apply our generalised T c method which accounts 
for a bright and faint limit [see Fig. [4] (right)]. It is evident 
from this plot that, even with a bright magnitude limit as 
faint as mt - = 16, the performance of the T c statistic at 
fainter magnitudes is largely unaffected, showing consistent 
behaviour for all the bright limits considered. 

5.2 Testing the SDSS-DR1 dataset 

As previously discussed, the SDSS data-set has both a pub- 
lished bright and faint apparent magnitude limit of m r = 
14.55 and m r — 17.45 mag. respectively. We, therefore, 
tested the completeness of the DR1 early type galaxies us- 
ing our generalised T c statistic which accounts for both a 
faint and bright limit. As an illustration, we chose to fix the 
bright limit of the SDSS data-set to be equal to the pub- 
lished value, and computed the T c statistic as a function of 
the trial faint magnitude limit. We could, alternatively, have 
fixed the value of faint magnitude limit and computed the 
value of T c for different trial values of the bright limit, or 
indeed we could have treated simultaneously both the faint 
and bright limits as free parameters. However, the case we 



present is sufficient to illustrate our method. Fig. [5] (left) 
shows the resulting T c curve. We see that our results are in 
agreement with the published faint magnitude limit - i.e. the 
behaviour of T c is consistent with magnitude completeness 
up to and including a sharp, faint limit of m r — 17.45 mag., 
followed immediately by the strongly negative behaviour ex- 
pected for an incomplete sample at fainter magnitudes. 

The right-hand panel of Fig.[5]shows the T c curve com- 
puted using the traditional Rauzy method with a faint limit 
only. Here the results show a similar trend to that seen for 
the MGC data-set with an artificially imposed bright limit 
section, and further underlines to importance of correctly ac- 
counting for a bright and faint limit when both are present 
in the data. In a future paper we will explore in more detail 
the completeness of not only the current SDSS data releases, 
but also how that completeness varies with bandpass. 

5.3 The 2dFGRS Survey 

We move finally to the 2dFGRS survey which, as discussed 
in the Data section, has a published faint limit of ra£ = 
19.45 mag. 

5.3.1 The Rauzy method with a faint limit only 

Our initial approach was to apply the traditional T c statistic 
to the 2dFGRS since the published literature on the survey 
gives no indication about the presence of a secondary bright 
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Figure 5. Performance of the T c statistic applied, as an illustrative example, to the SDSS-DR1 early-type elliptical galaxies. In the left 
panel we compute T c using our extended method, fixing the bright magnitude limit to equal the published value of 14.55 mag. We see 
that, in this case, the behaviour of T c is consistent with magnitude completeness up to and including the published faint limit of 17.45 
mag., but the statistic drops rapidly thereafter - indicating the sharp onset of magnitude incompletness. In the right hand panel, on the 
other hand, we compute T c following Rauzy (2001) — i.e. assuming a faint magnitude limit only. As anticipated, we see that the test 
statistic deviates very strongly from its expected value for a complete data-set at magnitudes which are much brighter than the published 
faint magnitude limit (although it is worth noting that T c still decreases even more rapidly once the published faint limit is exceeded). 



limit. Fig. [6] (left) shows the behaviour of T c as a function of 
trial magnitude limit, m{, for five different cases. The solid 
red curve represents the completeness test with with no k- 
or e-corrections applied. The remaining four curves show 
Tc with various (k + e)- corrections applied to the 2dFGRS 
data-set, as shown in the figure key. 

Consider firstly the uncorrected data-set (solid red 
curve). We see that for rre* < 14.85 mag. the T c statis- 
tic appears to behave as we would expect for a complete 
sample (although of course this is at the cost of 'throwing 
away' most of the galaxies in the survey by imposing such a 
low value for the faint limit). However, for higher values of 

the statistic drops dramatically to a minimum value of 
nearly 8a below its expectation value of zero at = 16.90 
mag. As we continue to increase m* , T c rises sharply to reach 
a peak at = 18.15 mag, beyond which the statistic drops 
dramatically again, exceeding 3a below its expected value 
at m* = 18.60 - i.e. significantly brighter than the published 
magnitude limit of mf im = 19.45 mag. 

Could the behaviour of T c be related to the fact that 
we have used an uncorrected data-set? To address this ques- 
tion consider now the remaining four curve s; the dotted 
and short dashed curves correspond to the I Cross! (|200ll ) 
and iNorberg et all |2001) global (k + e)-corrections respec- 
tively, whereas the long dashed and solid black cur ves corre- 
spond t o the type- dependent (k + e)-corrections of iNorbera 
(200230). It is clear that the adoption of any of these cor- 
rections has very little effect on the completeness statistic 
compared with the uncorrected case. Indeed, if anything, the 



addition of (k + e)-corrections appears to yield a T c statistic 
which is more strongly inconsistent with a complete sam- 
ple. This latter effect may be explicable in the same manner 
as for the corrected MGC magnitudes described in section 
15.11 although it should be noted that the type-dependent 
(k + e)-corrections do not appear to perform significantly 
better than their global counterparts. 

The fact that the value of T c differs from zero at many 
standard deviations over a wide range of trial faint magni- 
tude limits is clear evidence that the distribution of M and 
Z for observable galaxies is not separable with these faint 
limits. The physical cause for this is not immediately clear; 
however, having demonstrated in the previous subsection 
the adverse impact on T c of not correctly accounting for a 
bright magnitude limit, we next apply our generalised test 
statistic to the 2dFGRS data-set. 

5.3.2 2dFGRS with a bright limit 

In the absence of a clear indication from the literature of 
what is an appropriate bright magnitude limit, we adopted 
the brightest galaxy in our subset (see j)4.2.1|) . m\ irn = 13.60 
mag. The right hand plot of Fig. [5] shows the T c curve ob- 
tained for the 2dFGRS (with no k- or e- corrections applied) 
as a function of trial faint limit, ml, with m\ irn = 13.60 mag. 
We have used a 8Z — 0.8 (a small SZ leads to low numbers 
of galaxies within the subsets, SI and 5*2, making our test 
statistic prone to large statistical fluctuations and therefore 
less sensitive to a sharp cut in magnitude) . The plot demon- 
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2dFGRS 2dFGRS, bright limit 




Limiting apparent magnitude m. 

Figure 6. Performance of the T c statistic applied to our 2dFGRS sample. In the left hand panel we compute T c assuming only a faint 
magnitude limit, for both uncorrected magnitudes and after applying various different (k + e)-corrections. Note that several anomalous 
features are apparent, which are strongly discrepant from the behaviour of the test statistic expected for a complete sample. Moreover, 
T c begins to drop very sharply around m ~ 19.0 mag. - significantly brighter than the published faint magnitude limit of 19.45 mag. In 
the right hand panel we include the effect of a bright apparent magnitude limit, adopting for simplicity a value equal to the apparent 
magnitude of the brightest galaxy in our sample. The resulting T c curve (shown for uncorrected magnitudes and computed assuming 
a fixed 'slice' width of 8Z = 0.8 in distance modulus) is now entirely consistent with magnitude completeness up to and including the 
published faint limit, but drops very sharply at fainter magnitudes. 



> 
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Figure 7. Comparison of the T c and T v statistics computed for MGC (left panel), SDSS-DR1 (middle panel) and 2dFGRS (right panel). 
For the latter two surveys the same bright limits were adopted as in Figs. 5 and 6 respectively, and appropriate values for 8Z (for T c ) 
and 8M (for T v ) were chosen. Note the almost identical agreement of the test statistics in each case. To illustrate the robustness of 
this result, the MGC results are with (k + e)-corrections applied, the SDSS-DR1 results are with A"-corrections only applied, while the 
2dFGRS results are for uncorrected galaxy data. 
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strates a dramatic change in behaviour of the 2dF complete- 
ness, compared with the traditional T c statistic. By simply 
accounting for a bright limit - notwithstanding the fact that 
no published bright limit has been reported in the literature 
- we find that the 2dFGRS data-set is indeed complete to 
the published faint magnitude limit with no evidence for 
residual systematics. 



because evolution will, of course, break the separability of 
the underlying joint distribution of M and Z, i.e. the condi- 
tional distributions of M at given Z and Z at given M will 
no longer be simply equal to their marginal distributions. It 
seems likely, therefore, that an exploration of the systematic 
differences between T c and T v for deeper surveys may be an 
effective probe of evolution. 



6 APPLICATION OF THE T v STATISTIC 

In the previous sections we introduced and applied our im- 
proved T c statistic, which accounts for both a faint and 
bright magnitude limit in assessing the completeness of a 
magnitude-redshift survey. In this section we apply the T v 
statistic, introduced in Section 3 above, to the same data- 
sets. Our T v statistic can be thought of as an improved, 
differential, version of the classical V/Vmax test of galaxy 
evolution, which is generally presented in the literature as 
yielding a single number - the mean value of V/Vmax aver- 
aged over all galaxies in the survey, adopting a given faint 
apparent magnitude limit and assuming that the underlying 
spatial distribution of galaxies is homogeneous. In contrast, 
we can compute T v as a function of an incrementally in- 
creasing m\ (and/or indeed, if we wished, an incrementally 
decreasing bright limit, mjj m ) and thus analyse our data- 
set via a series of progressively truncated subsets. Crucially, 
moreover, like the T c test and unlike the V/Vmax statistic, 
T v has the important property of being independent of the 
spatial distribution of galaxies within the survey. 

Fig. [7] shows a comparison of the T v and T c curves for 
all three surveys. The left hand plot is the MGC survey with 
(k + e)-corrections applied. The T v curve shows an almost 
identical match to the T c statistic. Similar behaviour is evi- 
dent with SDSS-DR1 and 2dFGRS (middle and right plots). 
That T v and T c give a consistent indication of the complete- 
ness of these surveys should not be too surprising, since we 
are confident (at least once a bright limit is included in our 
analysis of 2dFGRS) that all three are well calibrated and 
well understood. Moreover, they are all relatively shallow in 
redshift range, which means that extinction and evolution 
corrections are not likely to impact too strongly on our as- 
sessment of their completeness (a fact which is supported by 
our results for T c ). However, one can ask under what condi- 
tions might the two statistics T c and T v diverge from each 
other? 

Consider a galaxy, i, characterised by its 'coordinates' 
(Mi,Zi). We have two complementary ways of generating 
volume limited data-sets for such a pair: 

• at fixed luminosity we can ask what redshift distribu- 
tion will produce apparent magnitudes permitted by our se- 
lection criteria? 

• alternatively, at fixed redshift we can ask what distribu- 
tion of luminosities (i.e. what part of the underlying galaxy 
luminosity function) are we sampling, given our selection 
limits in apparent magnitude? 

The former criterion resembles the procedure used to con- 
tract the T v statistic, while the latter criterion is more 
closely related to the procedure used to construct T c . This 
also implies that one might expect the two statistics to be- 
have differently when evolution becomes important - simply 



7 SUMMARY 

We have developed the completeness test statistic, T c , first 
introduced in Rauzy (2001), technique to account for the 
presence of both a faint and bright apparent magnitude limit 
in magnitude-redshift samples. We have applied it to the 
MGC, SDSS-DR1 and 2dFGRS surveys. Our results con- 
firm the completeness of data-sets such as SDSS-DR1 (early 
types only) where a faint and bright limit is well defined 
and published in the literature. Specifically, we have demon- 
strated that SDSS-DR1 is complete in apparent magnitude 
up to the published magnitude limit of m r = 17.45 mag 
indicating no residual systematics. Similarly, the magnitude 
completeness of the MGC survey has also been confirmed up 
to its published limit of m B . = 20.0 mag. Interestingly, how- 
ever, we found that when we incorporated (fc + e)-corrections 
to the MGC data-set, a noticeable (although not statisti- 
cally significant) departure from the expected value of the 
T c statistic - and indeed from the computed value of T c for 
the uncorrected data - was observed close to the magnitude 
limit. A possible cause for this effect could be the mixing of 
galaxy populations to which a global (k + e) is then applied 
- resulting in a slightly blurred magnitude limit. 

Our initial approach to the 2dF galaxy survey was to 
apply the original Rauzy test which accounts for a single, 
faint magnitude limit only. This was motivated by the cur- 
rent literature, in which only a faint limiting magnitude of 
m = 19.45 was well defined in the survey. However, our the 
application of our T c test reveal that the 2dFGRS is strongly 
inconsistent with being complete in apparent magnitude un- 
less a secondary bright limit (m = 13.6 in our subset) is 
included. 

Finally, we have also developed another variant on the 
original Rauzy T c completeness statistic, which we denote by 
T v , based on the cumulative distance distribution of galaxies 
in a magnitude-redshift survey. We find that T v has potential 
advantages over the widely used V/Vmax test: not least, the 
T v statistic retains the same properties as that of T c - i.e. 
is independent of the spatial distribution of galaxies within 
the survey. Furthermore, we have shown by example, that 
T v - when applied to the same well calibrated and relatively 
shallow survey samples as T c - produces almost identical 
results to that of the T c statistic. Our future work in this area 
will explore the application of T v and T c to deeper surveys, 
where evolution and fc-corrections become more important 
issues, to investigate the potential of these two statistics as 
diagnostics of luminosity and density evolution. 
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